This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX
/
xc-llm-ascend
Watch
3
Star
0
Fork
0
You've already forked xc-llm-ascend
Code
Issues
Pull Requests
Projects
Releases
Wiki
Activity
Files
v0.18.0
xc-llm-ascend
/
docs
/
source
/
assets
/
quantization
/
quant_method_call_flow.png
67 lines
15 KiB
Plaintext
Raw
Permalink
Normal View
History
Unescape
Escape
[Doc]modify the quantization user guide and add a quantization adaptation developer guide (#5554) ### What this PR does / why we need it? This PR makes the following modifications: 1.delete the `user_guide/feature_guide/quantization-llm-compressor.md` and merge it into `user_guide/feature_guide/quantization.md`. 2.update the content of `user_guide/feature_guide/quantization.md`. 3.add guidance `developer_guide/feature_guide/quantization.md' on the adaptation of quantization algorithms and quantized models. ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: https://github.com/vllm-project/vllm/commit/7157596103666ee7ccb7008acee8bff8a8ff1731 --------- Signed-off-by: IncSec <1790766300@qq.com> Signed-off-by: InSec <1790766300@qq.com>
2026-01-05 09:12:11 +08:00
RIFFt: